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technique for ranking documents within a database whose content has a large variation in 

to . 

quality and importance. Another aspect of the present invention is to provide a document 
ranking method that is scalable and can be applied to extremely large databases such as 
the world wide web. Additional aspects of the invention will become apparent in view of 
the following description and associated figures.—. 




Please delete the paragraph at page 4, line 18. 



*0 



Please rejjja*5etoe paragraph beginning at page 4, line 19, with the following 
rewritten paragraph: 



J JU relevance only from the intrinsic content of a document, or from the anchor text of 

Up! 



—One aspect of the present invention is directed to taking advantage of the linked 
structure of a database to assign a rank to each document in the database, where the 
document rank is a measure of the importance of a document. Rather than determining 

At, 

backlinks to the document, a method consistent with the invention determines importance 
from the extrinsic relationships between documents. Intuitively, a document should be 
important (regardless of its content) if it is highly cited by other documents. Not all 
citations, however, are necessarily of equal significance. A citation from an important 
document is more important than a citation from a relatively unimportant document. 
Thus, the importance of a page, and hence the rank assigned to it, should depend not just 
on the number of citations it has, but on the importance of the citing documents as well. 
This implies a recursive definition of rank: the rank of a document is a function of the 
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^ fl ranks of the documents which cite it. The ranks of documents may be calculated by an 

iterative procedure on a linked database.—. 



Please repl^ce'the paragraph beginning at page 5, line 19, with the following: 
—In one aspect of the invention, a computer implemented method is provided for 
scoring linked documents. The method includes identifying a plurality of documents, at 
least some of the documents being linked documents, at least some of the documents 
being linking documents, and at least some of the documents being both linked 
documents and linking documents, each of the linked documents being pointed to by a 
link in one of more of the linking documents; assigning a score to each of the linked 
documents based on scores associated with the one or more linking documents; and 
processing the linked documents according to the assigned scores. 

In accordance with another implementation consistent with the present invention, 

Q 

iy a method for scoring linked documents includes receiving a search query from a user; 

O 

j " identifying a plurality of documents responsive to the search query; locating incoming 

links to the identified documents from corresponding linking documents; assigning a 
score to each of the identified documents based on a number of the incoming links to the 
identified document and an importance of the incoming links; creating a ranked list based 
on the scores of the identified documents; and presenting to the user information about 
the identified documents in an order that is based on the ranked list. 

In accordance with yet another implementation consistent with the present 
invention, a method for organizing linked nodes includes determining first link 
information for a linked node; determining second link information for linking nodes that 
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link to the linked node; and calculating a score for the linked node based on both the first 

link information and the second link information. 

In accordance with a further implementation consistent with the present invention, 

a method of organizing linked documents includes: (a) identifying a first linked 

document; (b) identifying links between linking documents and the first linked 

document; (c) assigning a weight to each of the identified links; (d) determining a score 

for the first linked document based on (i) a number of the identified links between the 

linking documents and the first linked document, and (ii) the weights assigned to each of 

the identified links; (e) repeating steps (a) - (d) for a second linked document; and (f) 

organizing the first and second linked documents based on the determined scores. 

In accordance with another implementation consistent with the present invention, a 
method of organizing documents includes identifying a plurality of linked documents; 
identifying primary linking documents that link to the linked documents; identifying 
secondary linking documents that link to the primary linking documents; assigning a 
score to each of the linked documents based on (i) the number of links between the 
primary linking documents and the linked document and (ii) the number of links between 
the primary linking documents and the secondary linking documents; and organizing the 
linked documents according to the assigned scores. 

Additional aspects, applications and advantages will become apparent in view of 
the following description and associated figures.—. 



Please dejefe'the paragraph beginning at page 5, line 3 1 . 
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At page^j^ffine 17, please a^rthe following paragraph: 



As 

r 



-Fig. 3 is a flowchart of one implementation of the invention.- 



i ess. 



<J1 



Please replace the paragraph beginning at page 6, line 19, with the following 
rewritten paragraph: 



-Although the following detailed description contains many specifics for the 
purposes of illustration, anyone of ordinary skill in the art will appreciate that many 
p variations and alterations to the following details are within the scope of the invention, 
jjg Accordingly, the following embodiments of the invention are set forth without any loss of 
in generality to, and without imposing limitations upon, the claimed invention. For support 
^ in reducing the present invention to practice, the inventor acknowledges Sergey Brin, 

s hst 

j & Scott Hassan, Rajeev Motwani, Alan Steremberg, and Terry Winograd.--. 



, ple^s^ac 



At page 1 1, line 26, please add the following paragraph: 



-Fig. 3 shows one embodiment of a computer implemented method for calculating 
an importance rank for N linked nodes of a linked database. At a step 101, an initial N- 
dimensional vector p 0 is selected. An approximation p n to a steady-state probability po> in 
accordance with the equation p n = A n po is computed at a step 103. Matrix A can be an 
NxN transition probability matrix having elements A[i][j] representing a probability of 
moving from node i to node j. At a step 105, a rank r[k] for node k from a k** 1 component 
of p n is determined.--. 
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Please refrface the paragraph beginning at page 1 1 , line 27, with the following 
rewritten paragraph: 

—In one particular embodiment, a finite number of iterations are performed to 
approximate p^. The initial distribution can be selected to be uniform or non-uniform. A 
uniform distribution would set each component of po equal to 1/N. A non-uniform 
distribution, for example, can divide the initial probability among a few nodes which are 
known a priori to have relatively large importance. This non-uniform distribution 
decreases the number of iterations required to obtain a close approximation to poo and also 
is one way to reduce the effect of artificially inflating relevance by adding unrelated 
terms.--. 




Please replace the paragraph at page 12, line 6, with the following rewritten 
paragraph: 



—In another particular embodiment, the transition matrix A is given by 
A = ot/NJL+(l-a)B, 

where Jl is an NxN matrix consisting of all Is, a is the probability that a surfer will jump 
^ randomly to any one of the N nodes, and B is a matrix whose elements B[i][j] are given 
by 

{ 0 otherwise 

where m is the total number of forward links from node i. The (1-a) factor acts as a 
damping factor that limits the extent to which a document's rank can be inherited by 
children documents. This models the fact that users typically jump to a different place in 
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r - th e. w eb-^teHbdtowing a few links, l ne value ot a is typicallyarotmtH"5%nncluding 
. this dampiji gisimpeftantn^ iterations are used to calculate the rank so thai " 

JherejsjiQ^itifici^^ of rank impoitance-wilhi^ l oo p s-oftfee^wefar"^ 



Please^epace the paragraph beginning at page 12, line 27, with the following 
rewritten paragraph: 

—Consistent with the present invention, there are several ways that this method can 
be adapted or altered for various purposes. As already mentioned above, rather than 
including the random linking probability a equally among all nodes, it can be divided in 
various ways among all the sites by changing the H matrix to another matrix. For 
example, it could be distributed so that a random jump takes the surfer to one of a few 
nodes that have a high importance, and will not take the surfer to any of the other nodes. 
This can be very effective in preventing deceptively tagged documents from receiving 
artificially inflated relevance. Alternatively, the random linking probability could be 
distributed so that random jumps do not happen from high importance nodes, and only 
happen from other nodes. This distribution would model a surfer who is more likely to 
make random jumps from unimportant sites and follow forward links from important 
sites. A modification to avoid drawing unwarranted attention to pages with artificially 
inflated relevance is to ignore local links between documents and only consider links 
between separate domains. Because the links from other sites to the document are not 
directly under the control of a typical web site designer, it is then difficult for the 
designer to artificially inflate the ranking. A simpler approach is to weight links from 



__AJtemativelyrone may set a=0 and only iterate a tew tim"55in^e^cuktioiL^-- 
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pages contained on the same web server less than links from other servers. Also, in 

addition to servers, internet domains and any general measure of the distance between 

links could be used to determine such a weighting.—. ' 



in 



Please replace the paragraph beginning at page 14, line 8, with the following 
rewritten paragraph: 



—Various implementations of the invention have the advantage that the 
convergence is very fast (a few hours using current processors) and it is much less 
expensive than building a full-text index. This speed allows the ranking to be customized 
or personalized for specific users. For example, a user's home page and/or bookmarks 
can be given a large initial importance, and/or a high probability of a random jump 
returning to it. This high rating essentially indicates to the system that the person's 
homepage and/or bookmarks does indeed contain subjects of importance that should be 
highly ranked. This procedure essentially trains the system to recognize pages related to 
the person's interests.-. 



Please replace the paragraph beginning at page 15, line 10, with the following 
rewritten paragraph: 

-Another important application and embodiment of the present invention is 
directed to enhancing the quality of results from web search engines. In this application 
of the present invention, a ranking method according to the invention is integrated into a 
web search engine to produce results far superior to existing methods in quality and 
performance. A search engine employing a ranking method of the present invention 
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provides automation while producing results comparable to a human maintained 

^ / categorized system. In this approach, a web crawler explores the web and creates an 

index of the web content, as well as a directed graph of nodes corresponding to the 

structure of hyperlinks. The nodes of the graph (i.e., pages of the web) are then ranked 

according to importance as described above in connection with various exemplary 

embodiments of the present invention.—. 

/ 

Please replace the paragraph beginning at page 15, line 25, with the following 



£<=a 



rewritten paragraph: 



-The search engine is used to locate documents that match the specified search 
criteria, either by searching full text, or by searching titles only. In addition, the search 
can include the anchor text associated with backlinks to the page. This approach has 
several advantages in this context. First, anchors often provide more accurate 
descriptions of web pages than the pages themselves. Second, anchors may exist for 
images, programs, and other objects that cannot be indexed by a text-based search engine. 
This also makes it possible to return web pages which have not actually been crawled. In 
addition, the engine can compare the search terms with a list of its backlink document 
titles. Thus, even though the text of the document itself may not match the search terms, 
if the document is cited by documents whose titles or backlink anchor text match the 
search terms, the document will be considered a match. In addition to or instead of the 
anchor text, the text in the immediate vicinity of the backlink anchor text can also be 
compared to the search terms in order to improve the search.—. 
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Please replace the paragraph beginning at page 16, line 10, with the following 

rewritten paragraph: 



-Once a set of documents is identified that match the search terms, the list of 
documents is then sorted with high ranking documents first and low ranking documents 
last. The ranking in this case is a function which combines all of the above factors such 
as the objective ranking and textual matching. If desired, the results can be grouped by 
category or site as well.—. 




Please replace the paragraph beginning at page 16, line 18, with the following 
rewritten paragraph: 



-It will be clear to one skilled in the art that the above embodiments may be altered 
in many ways without departing from the scope of the invention. Accordingly, the scope 



s 4 of the invention should be determined by the following claims and their legal 

d 

lU equivalents.—. 



IN THE CLAIMS: 



Please cancel claims 1-8 without prejudice or disclaimer. 
Please add new claims as follows: 



—9. A computer implemented method of scoring a plurality of linked 
documents, comprising: 



^JT° identifying a plurality of documents, at least some of the documents being linked 

documents, at least some of the documents being linking documents, and at least some of 



10 



